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Abstract 

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a recently 
identified lineage B coronavirus, causing rapid worldwide outbreak of Corona Virus 
Disease 2019 (COVID-19). Despite genetically closed to SARS-CoV, SARS-CoV-2 
seems to possess enhanced infectivity and subtle different clinical features, which may 
hamper the early screening of suspected patients as well as the control of virus 
transmission. Unfortunately, there are few tools to predict the potential target organ 
damage and possible clinical manifestations caused by such novel coronavirus. To solve 
this problem, we investigate the potential host cell entry mechanisms of SARS-CoV-2 
through bioinformatics. Using the online single-cell sequence datasets, we analyze the 
expression of major receptor in host cells that mediates the virus entry, including 
angiotensin converting enzyme 2 (ACE2) and its co-expressed membrane 
endopeptidases. The results indicated the differential expression of ADAMIO and 
ADAM17 might contribute to the ACE2 shedding and affect the membrane ACE2 
abundance. We further confirm a putative furin-cleavage site reported recently in the 
spike protein of SARS-CoV-2, which may facilitate the virus-cell fusion. Based on 
these findings, we develop a novel approach that comprehensively analyzed the virus 
receptor expression, ACE2 shedding, membrane fusion activity, virus uptake and virus 
replication to evaluate the infectivity of SARS-CoV-2 to different human organs. Our 
results indicate that, in addition to airway epithelia, cardiac tissue and enteric canals are 


susceptible to SARS-CoV-2 as well. 
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Introduction 

In December 2019, a novel coronavirus (SARS-CoV-2) epidemic emerged in 
Wuhan, Hubei, China and rapidly spread globally [1]. As of 26 February 2020, there 
are 78,196 confirmed cases and 2,718 fatalities in all province of China, posing a 
serious challenge for the public health. Similar to severe acute respiratory syndrome 
(SARS)-CoV, SARS-CoV-2 was distinguished as group 2B CoV and genetically related 
to SARS-CoV with -80% nucleotide identity [2]. Although patients with SARS-CoV- 
2-induced pneumonia exhibited typical symptoms of viral pneumonia including cough, 
fever, myalgia, headache and even acute respiratory distress syndrome (ARDS), 
substantial portion of patients could experience atypical symptoms like diarrhea 
(gastrointestinal discomforts), acute kidney injury, liver damage and acute cardiac 
injury [3], challenging clinical interventions and long-term rehabilitations. Here, we 
speculated that in-depth analysis of virus entry into host cells would be crucial to 
explain the different sensitivities of various cell types to SARS-CoV-2 and the above- 
mentioned diversity of clinical manifestations. 

Coronaviruses have been reported to enter host cells in two distinct routes [4]. The 
first route is the well-known receptor-mediated endocytosis. The surface spike (S) 
glycoprotein is widely accepted as the major determinant of coronaviruses entry, with 


the S1 domain mediating the receptor binding and the S2 domain responsible for virus- 


cell membrane fusion [5]. Upon binding to the corresponding receptor on host cells 
surface, the S protein undergoes subtle conformational changes result in the uptake by 
endosomes for further proteolytic digest [6-7]. In the case of SARS-CoV, numerous 
studies have demonstrated angiotensin converting enzyme 2 (ACE2) receptor on the 
surface of human cells could be utilized to facilitate virus entry [8-9]. Given closely 
related to SARS-CoV, emerging evidences indicate the S1 subunit of SARS-CoV-2 may 
also bind ACE2 receptor in lung, renal and cardiac tissue, liver, testis and intestinal 
epithelia to mediate the virus fusion and lead to the related symptoms [10-13]. 
Furthermore, ACE2 could be shed from the surface of human airway epithelia, the site 
of SARS-CoV infection, by a disintegrin and metalloproteinase (ADAM) family of 
proteases, which contributes to the ACE2 downregulation [14-15]. This means the 
expression pattern of ACE2 in normal tissues may not directly correlate with their 
susceptibility to the SARS-CoV-2. 

Alternatively, the S protein could be cleaved by S-activating protease co-expressed 
with the host cell receptor, thereby inducing the direct fusion of viral and cellular 
membrane [16]. Millet ef al. identified the S protein of Middle East respiratory 
syndrome coronavirus (MERS-CoV) could be cleaved and activated by furin, a 
ubiquitously expressed endopeptidase, which is essential for the MERS-CoV infection 
[17]. Additionally, transmembrane protease, serine 2 (TMPRSS2) was reported to 
directly cleave and activate SARA-S, rendering SARS-S-driven virus-cell fusion [18- 
19]. However, few researchers investigate the potential endopeptidases involved in the 


cleavage of SARS-CoV-2 S protein, leading to the infections of multiple organs. 


Based on the above, the distribution of ACE2 may not strictly correlated with the 
cell tropism of SARS-CoV-2. Other influencing factors should be taken into 
consideration to predict the potential tissue damage besides pulmonary injury. In the 
current study, we analyzed the online single-cell sequence datasets to unveil the 
expression pattern of ACE2 as well as ACE2 co-expressed endopeptidases to 
comprehensively assess the infective efficiency of SARS-CoV-2 in different organs. 
Furthermore, we emphasized the overall analysis of ACE2 expression, FURIN 
expression and ACE2 shedding to explain the various clinical manifestations in 


different targeted organs after SARS-CoV-2 infection. 


Methods 
Public scRNA-seq dataset acquisition 

The gene expression and cell type annotation of single cell RNA-sequencing 
(scRNA-seq) data from normal human lung, nose, heart and enteric canal were 
downloaded from the Gene Expression Omnibus (GSE122960, GSE121600, 


GSE121893, GSE125970, GSE119969). 


scRNA-seq data processing 

Seurat (version 2.3.2) and Scanpy (1.4.4.postl) [20-21] were used to perform 
the scRNA-seq data processing following the standard procedure (mentioned at 
https://scanpy.tutorials.readthedocs.io/en/latest/pbmc3k.html and https://satijalab.org 


/seurat/v3.1/pbmc3k_tutorial.html) 


Filtering of ACE2 co-expressing genes 

All original data were treated through a normalization procedure by Scanpy 
(https://scanpy-tutorials.readthedocs.io/en/latest/pbmc3k.html). Cell count matrixs 
were normalized in order to make them comparable through all qualified cells. The 
newly formed normalization matrixes were used to find genes co-expressing with 
ACE2. Cells with ACE2 expression 70 were selected from normalized gene-cell matrix. 
From these filtered ACE2 positive cells, if the number of cells expressing one gene did 
not exceed the 10 percent ofthe total ACE2 positive cell, it would be excluded. The rest 
would be regarded as ACE2 co-expressing genes. The signature scores were calculated 
basing on the method mentioned by A Wallrapp et al. [22]. The Violin plots representing 
the gene expressions and score values were drawn by  seaborn 
(https://seaborn.pydata.org/generated/seaborn.violinplot.html?highlight-violinplotzse 
aborn.violinplot) and matplotlib (https://matplotlib.org/api/ as gen/matplotlib.pyplot. 


violinplot.html?highlight=violinplot#matplotlib.pyplot.violinplot). 


Sequencing analysis of virus spike protein 

The complete genome and protein sequences of SARS-CoV-2 (NC_045512.2) and 
SARS-Cov (NC_004718.3) were acquired from GenBank. The protein sequences of 
their spike protein were picked up for complete alignment with software Clustal X2 
[23]. The secondary structure prediction of the spike protein of SARS-CoV-2 was 


performed by the online tool SOPMA  (https://npsa-prabi.ibcp.fr/cgi- 


bin/npsa automat.pl?page-npsa sopma.html). 


Results 
Candidate endopeptidases related to virus entry 

The alveolar type II cells of lung were enrolled into our study. After filtering the 
ACE2 co-expressed genes, eight candidate endopeptidases related to virus entry were 
selected according to the selection flow chart (Fig 1), including ADAMIO, ADAM17, 
FURIN, SPPL2B, TMPRSS2, HPN, CAPNI and CAPNSI. They are all located on cell 
membrane. ADAM10 and ADAM17 were reported to cleave the ACE2 to make it shed 
from cell surface [14-15]. The cleaving enzyme activity of SPPL2B is similar to FURIN 
[24]. FURIN is a key factor for viral entry of human immunodeficiency virus type-1 
(HIV-1) through cleaving its envelope glycoprotein [25]. In addition, FURIN could 
proteolytically activate membrane fusion activity of influenza viruses through cutting 
their hemagglutinin [26]. TMPRSS2 and HPN both belong to the serine protease family 
containing a type II transmembrane domain. They were reported to be involved in the 
viral entry of SARS-CoV, Sendai virus (SeV), human metapneumovirus (HMPV), 
human parainfluenza viruses (HPIV) and influenza A virus [4, 27-29]. The products of 
gene CAPNI and CAPNSI formed micromolar-calpain (m-calpain). The m-calpain 
was regarded as an important factor to the replication of SARS-CoV [30]. Hence, these 
candidates were enrolled into the following research to detect their expression in the 


target organs. 


The putative furin cleavage site in the spike protein of SARS-CoV-2 

As mentioned above, six genes of our candidates, except FURIN and SPPL2B, have 
been reported to be related to the host cell entry of SARS-CoV. Thus, we wonder 
whether FURIN would contribute to the viral entry of SARS-CoV-2. To verify this 
hypothesis, we analyzed the spike protein sequence of SARS-CoV-2. The results 
indicated that the residue 682-685 contained the minimal furin cleavage motif R-X-X- 
R [31]. This region was highly similar to the S1/S2 cleavage site of SARS-CoV (Fig 
2A). The results of secondary structure prediction showed this putative furin cleavage 
site located in the random coil region, which is the preferred target of endopeptidases 
(Fig 2B). Hence, we speculated that FURIN might proteolytically activate membrane 


fusion activity of SARS-CoV-2 through cleaving the spike protein. 


The expression of candidate endopeptidases in target organs 

The epithelial cells of nasal brushing samples and nasal turbinate, the 
cardiomyocytes and endothelial cells of heart, and the epithelial cells of enteric canal 
were enrolled into our study. As showed in Fig 3, the expression of ACE2, TMPRSS2 
and CAPNSI in lung were abundant, and the FURIN and HPN were moderately 
expressed. The expression of ACE2 and FURIN in the nose were slightly fewer than 
those in the lung. TMPRSS2 expression in the nose was significantly less than that in 
the lung. These results indicate that SARS-CoV-2 could utilize ACE2 and TMPRSS2 
to infect airway epithelia, which is similar to the infection mechanism of SARS-CoV. 


Interestingly, the expressions of CAPNI and CAPNSI in the nose were significantly 


more abundant than that in the lung, indicating that the viruses would replicate rapidly 
once they could enter the nasal epithelial cells and SARS-CoV-2 might infect nasal 
epithelia more easily than SARS-CoV. 

Notably, the FURIN expression in cardiac tissue was moderate, which was similar to 
that in the nose. This suggests SARS-CoV-2 may be able to infect cardiomyocytes and 
cause cardiac damage. In contrast, SARS-CoV is unlikely to utilize furin to enter 
cardiomyocytes due to the absence of furin cleavage site. Additionally, TMPRSS2, a 
well-known endopeptidase capable of cleaving and activating SARS-S protein was not 
detected in the cardiac tissue. Thus, we speculated the lack of endopeptidase- mediated 
direct virus-cell fusion may partially explain why SARS-CoV rarely cause cardiac 
damage, despite of the abundant ACE2 expression in cardiomyocytes. 

In addition, the expression of ADAMIO and ADAM17 of epithelial cells in enteric 
canal were abundant. These results suggested that the actual amount of membrane 
ACE2 would be limited by shedding, despite their ACE2 expression was abundant in 
the normal enterocytes. This finding may be responsible for the relatively milder 
gastrointestinal discomforts of COVID-19 patients when compared with their 
respiratory symptoms [3]. 

In order to comprehensively assess the efficiency of viral entry to the target organs, 
we developed a method to calculate the signature score on virus receptor, ACE2 
shedding, membrane fusion activity, virus uptake and virus replication. The results were 


showed in Fig 4. 


Discussion 

The sheer speed of SARS-CoV-2 and the large number of infected people have 
quickly overwhelmed the public health services in Hubei Province, and has become a 
public health problem [32]. Besides the common respiratory system symptoms, many 
COVID-19 patients experience gastrointestinal symptoms, cardiac and renal injury and 
some may even be asymptomatic [3]. The diversity of clinical manifestations 
challenges the diagnosis of patients and hinders the individualized treatment. Therefore, 
studies on mechanisms of cell entry could help us comprehensively understand the 
pathogenesis of SARS-CoV-2 in affected tissues. 

The cell surface receptor ACE2 has been proven to facilitate SARS-CoV entry into 
host cells, and recent studies reveal the crucial role of ACE2 in receptor-mediated 
SARS-CoV-2 infection as well [33]. However, the expression of endopeptidases may 
provide a 100-fold higher efficient infection. pathway than receptor-mediated 
endocytosis does [34]. In this study, we explored the expression of screened 
endopeptidases co-expressed with ACE2 in various targeted cells using the online 
single-cell sequence datasets. Besides ACE2, the abundant expression of furin and other 
endopeptidases (such as TMPRSS2, CAPNI and CAPNS1) in the nasal epithelia 
provided prerequisites for SARS-CoV-2 infection and rapid replication in the upper 
respiratory tract. These findings are in accordance with the recent report of SARS-CoV- 
2 load in upper respiratory specimens (nasal and throat swabs) [1], suggesting its 
augmented transmission and requirement for different isolation strategies from SARS- 


CoV. 


Notably, we confirmed a potential furin cleavage site at the S1/S2 interface of 
SARS-CoV-2 by bioinformatics, which is absent in the S protein of SARS-CoV. This 
difference between the S protein of SARS-CoV and SARS-CoV-2 was also reported in 
another recent study from China [35]. Hence, we speculated the presence of furin during 
the course of SARS-CoV-2 infection may result in some clinical manifestations distinct 
from SARS-CoV infection. In the case of cardiac tissue, ACE2 expression was detected 
in the cardiomyocytes, providing the prerequisites for SARS-CoV and SARS-CoV-2 
infection. Correspondingly, acute cardiac injury has also been reported in SARS-CoV- 
2-infected patients, whereas rarely found after SARS-CoV infection [3]. We suppose 
the furin-mediated cardiomyocyte entry may explain the cardiac damage in some 
SARS-CoV-2 patients. Following up and evaluation of the cardiac function may be 
necessary in the SARS-CoV-2-infected patients, especially the elderly patients. 
Whether the treatment of furin inhibitors could benefits these SARS-CoV-2-infected 
patients deserves further researches. 

Another major point of this study is the global analysis of the susceptibility of 
potential targeted organs to SARS-CoV-2 infection. Since CoVs commonly infect 
respiratory and gastrointestinal systems [36], we take the enteric tissue as an example 
in the current study. Abundant ACE2 expression was detected in enterocytes from colon, 
intestine and rectum, corresponding to the enteric symptoms of SARS-CoV-2-infected 
patients. Meanwhile, the ACE2 sheddase, ADAM 10 and ADAM 17, were also high 
expressed in the enteric canals, among which the expression of ADAMS was relatively 


low in the ileum. These findings suggest the actual membrane ACE2 in the ileum 


epithelia may be higher than that in other enteric canals because of the lower ACE2 
shedding rate. Moreover, considering the virus receptor expression, membrane fusion 
activity, virus uptake and virus replication, the signature score of the ileum was much 
higher than that in the colon or rectum. Thus, we speculate that the ileum might be more 
susceptible to the SARS-CoV-2 gastrointestinal infection. Consistent with the recent 
study [12], this study calls for clinicians to pay attention to the fecal-oral transmission 
and the enteric damage, especially the ileal abnormalities in COVID-19 patients. 

In conclusion, our results highlight the potential role of endopeptidase, such as the 
ACE2 sheddase and furin, in the entry of SARS-CoV-2 into targeted cells. Additionally, 
we provided a novel method to assess the sensitivities of different targeted organs to 


SARS-CoV-2 infection, which may benefit the future clinical cares. 
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Figure legends 

Figure 1. Candidate selection flow chart 

Figure 2. Putative furin cleavage site 

Figure 2-A. The putative furin cleavage site located in the region that was similar to the 
S1/S2 cleavage site of SARS-CoV. 

Figure 2-B. The putative furin cleavage site was predicted to locate in the random coil 
region. 

Figure 3. The expression of selected endopeptidases in the target organs 

Figure 4. The signature score about viral entry of target organs 

Figure 4-A. The signature score of virus receptor 

Figure 4-B. The signature score of ACE2 shedding 

Figure 4-C. The signature score of membrane fusion activity 

Figure 4-D. The signature score of virus uptake 


Figure 4-E. The signature score of virus replication 
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Figure 4-D 
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Figure 4-E 
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